Modeling and Analysis of Availability of Datacenter Power Infrastructure

نویسندگان

  • Sriram Govindan
  • Di Wang
  • Lydia Chen
  • Anand Sivasubramaniam
  • Bhuvan Urgaonkar
چکیده

Realizing highly available datacenter power infrastructure is an extremely expensive proposition with costs more than doubling as we move from three 9’s (Tier-1) to six 9’s (Tier-4) of availability. Existing approaches only consider the cost/availability trade-off for a restricted set of power infrastructure configurations, relying mainly on component redundancy. A number of additional knobs such as centralized vs. distributed component placement, power-feed interconnect topology and component capacity over-provisioning also exist, whose impact has only been studied in limited forms. In this paper, we provide a systematic approach to understand the cost/availability trade-off offered by these configuration parameters as a function of supported IT load. We develop detailed datacenter availability models using Continuous-time Markov Chains and Reliability Block Diagrams to quantify the relative impact of these parameters on availability. Using real-world component availability data to parametrize these models, we offer a number of interesting insights into developing costeffective yet highly available power infrastructure. As two salient examples, we find (i) although centralized UPS placement offers high availability, it does so with significant cost, and (ii) distributed server-level UPS placement is much more cost-effective but does not offer meaningful availability for operating the datacenter at full load. Based on these insights, we propose a novel hybrid strategy that combines the server-level UPS placement with a rack-level UPS, achieving as good availability as existing centralized techniques, at just twothirds of its cost.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Modeling of Power Generation System of a Thermal Plant

The present paper discusses the development of a performance model of power generation system of a thermal plant for performance evaluation using Markov technique and probabilistic approach. The study covers two areas: development of a predictive model and evaluation of performance with the help of developed model. The present system of thermal plant under study consists of four subsystems with...

متن کامل

Failure Mode and Effect Analysis Power Plant Boiler

The current electricity demand is increasing, and now the government has involved third parties in the implementation of electricity so that investors compete in building infrastructure in order to apply electricity. Thermal power is one source that has a fast break event point compared to other resources that more interested investors even with all forms of pollution caused. A form of heat pow...

متن کامل

Analysis of Methods for Providing Availability and Accessibility of Cloud Services

The article describes methods for dealing with reliability and fault tolerance issues of cloud datacenters. These methods are mainly focused on the elimination of single point of failure within any component of the cloud infrastructure, including the availability of infrastructure and accessibility of cloud services. The methods for providing the availability of hardware, software and network c...

متن کامل

Reliability and Availability Analysis of Fusion Power Plants

Major efforts are underway to develop fusion energy for use in electric power production in the furture. While fusion reactor concepts are being developed, appropriate attention must be given to problems relvant to the utility requirements which are likely to be encountered in the commercialization phase. In this paper the expected fusion plant availability is assessed in detail due to the impo...

متن کامل

Modelo de gestão de predição de falhas no gerenciamento da infraestrutura de datacenter

This paper proposes a management model for predicting failures in Datacenter infrastructure management, based on the good practices for managing IT (Information Technology) infrastructure. The model refers to a system that monitors the Datacenter and detects faults in legacy equipment, thereby, avoiding that they collapse. Based on the ITIL library, but focusing only on the most relevant module...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010